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NOTES ON TIME SERIES ANALYSIS 
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ABSTRACT 

This paper describes a prograjn which was developed 
for analyzing time series quasi-experimental data. The 
computer program outlined generates a non-stationary, 
moving average, time seriec model which indexes the 
instantaneous change in level of the time series due to 
a predeslgnated treatment. The observations Just prior 
to and Just following the treatment are given primary 
weight in the model. 
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SPEC lAL REPORT No-. 722 
COMPUTER-BASED PROJECT for the EVALUATION 
of MEDIA for the HANDICAPPED 

T i t ! 8 ! NOTES ON TIME SERIES ANALYSIS 

BY: Robert Pruzek & Jack H. Bond 
BACKGROUND = 

-'■^ 

The Computer Based Project for the Evaluation of Media for the Handicapped, 
based on contract ftCEC-9-423617-4357 (616) between the Syracuse (N.Y.) City School 
DiGtrict and the Media Services and Captioned Films Branch, Bureau of Education 
for the Handicapped (United States Office of Education) for the five year period 
July 1, 1969 through June 30, 1074. The major goal is to irtprove the instruction 
of handicapped children through the development and use of an evaluation system 
to measure the instructional effectiveness of films and other materials with 
oducable mentally handicapped (EMH) children, in-service training and media support 
for special teachers, and studies related to the evali:istion process and the 
populations used . 

The Project has concentrated on the GOO films and 200 filmstrips from the 
Media Services and Captioned Films (BEH • USCE) depository; however, specific 
packages from Project LIFE, various elementary math curricula, and selected 
programs from Children's TV Workshop have also been evaluated. The evaluation 
model used requires that: 1) objectives of materials be specified and written? 

2) instruments be constructed to test and measure effectiveness? andi 3) children 
be the major sources of evaluation information; A number of instrmonts and 
methodologies are employed in the gathering of cognitive and affective data from 
900 EMH children and 80 special teachers to make the ejffectiveness decisions. 
Over half of the EJIH population can neither read or write; therefore, a unique 
Student Response System (SRS) its employed, consisting of a twenty station G.E. - 
1000 SRS which can be operated in a group or individual recording mode and is 
connected to a remote computer system. The computer capabilities consist of 
remote telephone connections to the Rome (N.Y. ) Air Development Command, the 
Honeywell time-shared network, and the Schenectady (N.Y.) G E Research and 
Development Center; and batch mode capabilities of the Syracuse City Schools, 
Syracuse University, and various commercial sources. 

In-service and media support activities provide on-the-job training for 
teachers, teacher aides, equiftnent, and materials to the special teachers in 
th^ } city schools i The research activities have centered around Investigations 
andV special problems related to the development of the evaluation model. The 
four major areas considered are: 1) testing effects, 2) captioning effects, 

3) special student characteristics? and, 4) evaluation procedures validation* 

Documentation of the major activities appear in the five annual reports 
and the 600 evaluations prepared on materials used. Staff members v/ere encouraged 
to prepare special reports and the attached paper is one of these. The opinions 
expressed in tliis publication do not necessarily reflect the position or policy 
of the Computer Based Project, the United States office of education, or the 
Syracuse City School District, and no official endorsement by any of the agencies 
should be inferred. 
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Notes on Time Series Analysis 

These notes are to describe a program for analysing time series quasl- 

experfmental data. The method of time series analysis is called on integrated. 

moWng average method, it has been found to be useful In a wide variety of 

Industrial and economic applications and seerns to hold conslderoble potential for 

studying educational data. The primary reference Is George Oox and George TIao - 

" ^ Change J n Level of a Non-stationary time series", Biometr I ka . i965 . p. 181-192. 

The assumptions underling the Box-T?ao method are somewhat d Iff leu; t to describe 

In words In their entirety, but three major points should be made: 

1) There will be an absoice of cycles or cystematlc trends In the orlalnal 
dataacross the entire series. That Is, the data should appear to nu^tuat^ 
more or less randomly around a general line of trend for the entire series 
Sustained arffts fn one direction or another will tend to nva ia?e ?h s 



?! <h^/ original data Is to be free of systematical cycles 
It shou d Show more-or-less randoo, fluctuation around the base line In 
^h.'Ji^il'r*.^" correiogram should not show the damped, cyclic cur v^ 
character Stic of autoregresslve time series. In fact Box-Tao offered 

^ 'argely to avoid major pro^lImso?u?o-^ 

regressive tf me series methods. 

fi^.'^cLrrK*'^'*^'',^^ the d_iiferencer, between successive observations In the 
time series has a "lag-one'' correlation which is large in absolute value 
(near 50 usually) whereas, all higher lag correlations - 2 3 4 etc ' 

L'ch oo^n If''' ^ ^"^-^'^^ Which may'be very 

much open to discussion In the context of a particular experiment.) 

Further general points to 6o made are that the time series observations are 

assumed to have been made at equally spaced i Ime Intervals; and that there Is 

Interest In testing the hypothesis of a possible shift In the level of the 

series which Is associated with the occurance of a particular event called a 

treatment. The Box-Tlao model Is properly descrlbod as a non-stationary. 



Integrated, movfng average time series process* In Its simplest form (not one to 
be described except briefly here), the model says that an observation at any 
particular point t, In the time series, Is a sum of three terms: (!) a fixed but 
unknown location parameter, L^, (2) the product of a quantity gamma ( ) times the 
accumulation of a previous set of i>0 values up to any point t In the series, and 
(3) the alpha ( ^^) value associated with that particular point In time. The 
alphas are taken as Independent random normal deviates, which are assumed to have 
constant variance of sIgma squared for each point of the series. 

We shal I consider a slightly more complex form of the model , In which three 
major parameters are Involved. Two equations may Initially be used to describe 
the model on which the program Is based. The first, which Is equivalent to the 
one in the preceeding paragraph Is that for any one of the observations prior 
to treatment T, any value 

t-J y. 

Zt " L 0<^- + (I) 

where L is the fixed location parameter, Is a randoor (normal ) deviate 

associated with the Jth point In time and Is a parameter Indexing the propor- 
t ion of the assumulated oC s ( ) which tiave been absorbed Into the 

j=:f ^ t-j 

time series Itself i 

The second equation states that for any observation at time t fol lowing 
the predeslgnatod treatment, the observation Is 

Z,. = ^ ^ f) ^ X X^^^C^ . + oC (2) 
^ ^ j=' +-J + 

This equation differs from (1) above only by the addition of the parameter ^ 
(delta) which Indexes an Instantaneous sh I ft I n^ i^^ time series which Is 

due to treatment T. The program, however, Involves a 'Veparameterl zatlon" 



stating that cL ^ M ^ cL +hus allowing for a general drift or slope, 

t-J y t-J 
of the time series — I.e. other than horlzorr:!. 

This has the effect of modifying equation (2) to 

Z = L +.-,5, + (, + ^ (t^D) + y ^ y +v>(^<2a) 
t ^ JM '^^t-J t. 

where the series Is expected to have drifted a total of ( 1 + (t-D) 
units at time t^ The three parameters of Interest then became L, 5* S * 

The Program and its outputs ; 

While there are, In fact, two programs which were written by Gene Glass and 
Thomas Maguire, for Box-T!ao method of times series analysis, the second program 
has been found to yield unl nterpretable results with artificial data sets which 
were Intended to be relatively straight-forward #i their characteristics* We 
shall Ignore the second program until a later date. For now we turn to a 
description of the working computer program. The data are assumed to be gathered 
in a running time series, with a predesignated event T which Is specified to have 
occured during the course of the series. Any Input data are required to be a 
running series of 1^1 + ^2 ^^^^'^vatlons. I shall describe the outputs consecu- 
tively with respect +0 the labelled output listing. 

For artiflciol;data with « 26 and = 9; Output I is simply a listing 
of the input Itself. A table like that of Table lA ought to be Inspected for 
each data set simply to get a general Impression of the nature of the time-trends. 
Output li is the table of product-moment correlation cc'eff I dents of lag 1, 2, ... 
3/4 Nj for the pre-*T. observations; their values are plotted automatical ly In 
Figure 1 1 A. This correlogram ( I lA) should be checked to see that there are no 
systematic trends, or obvious cycles In the time series. In particular, HA 
should not show a convent lona I damped sine curve, which tends to show up when an 



autoregtesslve model Is appropriate, s Table III and figure IIIa are directly 
analogous to II and IIA and the same type of inspection is recotrxaended. 

The outputs (IVj IVA) and (V, VA) are lag correlations for pre - and 
post - treatment d iff ev ence n> In the case of Table IV, the lag-one correlation, 
should be fairly large in abcolute value (often near .SO) and the succeeding 
corrolationf? should tjend to approidrtnte zero. 

The remaining tables and figures require a brief preairble. The Bcx-Tlao 
method is initially baced on a Dayesinn method of analysis which assumed that c 
parameter labelled gcmrca i S ) was initially known from previous research in 
the particular area where the ttnie series data has been gathered. The latter 
part of the 3oK-Tiao paper, however, indicates when this value of gr.mma is 
not known, the sample data may be us^^d to estimate gamiua by generating a 
maxinura likelihood solution taken over all possible values of gaLtjna under the 
assumption that the prior for is rectangular* This is what is done In 

the Gless-K-guire computer programs, The maximum likelihood statistics for 
L, and V- correspond to the row of Table VI where the pocterlor of ^ 
is largest. The delta parameter ^ , or its estlmatOj is ti e prluiary focus 
of analysis since this is the me isure of instantaneous change in level of 
the time scries. In addition to the 200 values of delta hat ( ^ ) which 
are given over the corresponding range of gammas, we also print the 200 
values of rau hat ( /U ) and ^L, These, respectively , estimate the general 
rate of ascent or descent (assuming the slope is fixed) and the overall 
location parameter. 

The raajor focus of interpretation should usually be Figure IVA in 
combination with Table Vl, In particular, the plot of in Figure VIA 
gives immediate graphical evidence of the maximum likelihood value of <:> 
based on the line for 

^IC ^^^J '"^^ artificial data set), 

A y\ 

where the specific values of 7^, and ^ ere printed as well as the 



st.mtl.n-a oiior estlmntes ^ fi. and and the t- statistics £oryJ 



und ^ . The t-vnlues for C are plotted for aM values of ^ in figure VIA 
as the "0" values. The object is to find statistically significant values 
of the Instantaneous effect ^ . It should be noted that ^' rf>preients 
the difforencc b<»(.w<.en two eKponentlal ly welshted means; obsc Imricdiately 
preceding, and following, the event T are weighted more heavily Lhati their 
adjacent (precedino, or foUovjJng) observations. Increasing the number of 
observation? either prior to, or after, the treatm'.'.:t event eerves largely 
to increaae the degrees of freedom for the t-statlstic (which Is Hj + - 3) 
as well as to stabilize the ectiraats j of L, ^4 and ^ . Outputs VII :ind 
VIIA are esscnCiaUy "frills" in that they present no fundamental ly new 
information beyond that given in Table VI. Nevertholess, the visual inpi^.ssion 
of; variability of the Gtf>tistic -q over the 20C values of ^ , as ^jell as 
its confidence interval, may facilitate speedy interpretation end comp.irisons 
of several different data sets. 

The Gtatlstlcs L and /A. m.iy be somewhjit difficult to Interpret, until 
one recognises that they nre not i^traight-forward unv/cighted measures; In 
particular, L Is pvoperlyf thought of as an estimate of overall level of 
the series, with primary emphasis on scores early in the aeries, ondyU is 
an estimator of sJone which is ranre heavily weighted by observations toward 
the end of the series. Nevertheless, the statistics should be reasonably 
comp-irible from )ae data set to arcthor at least when the suras IS^ do 
not differ readically, Th- values for ^ -^^^ ^ .i^Q of^ = 4.82 and 
yV». = .^04 do seem to correcpond crudely to overall Level and slope 
respectively, given the above qualifications. 

To suiDmarizo the major pointc, v/e have a non-stationary, moving average, 
^^t^cs model, computer progr?^.m for which has been genera^od especii^Uy 
tc>^f4l^imrite (\ paranetor^ , which indexes the instantaneous change in 



level of the time series due to a predeslgnoted treatment. The observations 
Just prior to and just following the treatment are given primary weight in 
the model. The method is based on a Bayesian model; a uniform rectangular 
prior distribution is assumed in the program for the unknown parameter gamma* 
A maximum likelihood estimate of that gi^rama is printed and plotted. 

Finally^ it rar.y be useful to note that the above* statistics, together 
with the estimated residual variances (Ist columns of Output VI), may be 
used to make planned (or even post- hoc) comparisons of comparable statistics 
for any comparable sets of time series points, For Instance-, see the chapter 
on planned comparisons in Hays (1963). 
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